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Abstract — We present controllers that enable mobile robots 
to persistently monitor or sweep a changing environment. The 
changing environment is modeled as a field which grows in 
locations that are not within range of a robot, and decreases 
in locations that are within range of a robot. We assume that the 
robots travel on given closed paths. The speed of each robot along 
its path is controlled to prevent the field from growing unbounded 
at any location. We consider the space of speed controllers that 
can be parametrized by a finite set of basis functions. For a 
single robot, we develop a linear program that is guaranteed 
to compute a speed controller in this space to keep the field 
bounded, if such a controller exists. Another linear program is 
then derived whose solution is the speed controller that minimizes 
the maximum field value over the environment. We extend our 
linear program formulation to develop a multi-robot controller 
that keeps the field bounded. The multi-robot controller has the 
unique feature that it does not require communication among 
the robots. Simulation studies demonstrate the robustness of 
the controllers to modeling errors, and to stochasticity in the 
environment. 

I. Introduction 

In this paper we treat the problem of controlling robots 
to perpetually act in a changing environment, for example to 
clean an environment where material is constantly collecting, 
or to monitor an environment where uncertainty is continually 
growing. Each robot has only a small footprint over which to 
act (e.g. to sweep or to sense). The difficulty is in controlling 
the robots to move so that their footprints visit all points in the 
environment regularly, spending more time in those locations 
where the environment changes quickly, without neglecting 
the locations where it changes more slowly. This scenario is 
distinct from most other sweeping and monitoring scenarios 
in the literature because the task cannot be "completed." 
That is to say, the robots must continually move to satisfy 
the objective. We consider the situation in which robots are 
constrained to move on fixed paths, along which we must 
control their speed. We consider both the single robot and 
multi-robot cases. Figure [T] shows three robots monitoring an 
environment using controllers designed with our method. 

We model the changing environment with a scalar valued 
function defined, which we call the accumulation function. 
The function behaves analogously to dust accumulating over 
a floor. When a robot's footprint is not over a point in the 
environment, the accumulation function grows at that point at 
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Fig. 1: A persistent monitoring task using three robots with heterogeneous, 
limited range sensors. The surface shows the accumulation function, indicating 
the quantity of material to be removed in a cleaning application, or the 
uncertainty at each point in a sensing application. The accumulation function 
grows when a robot's footprint is not over it, and decreases when the footprint 
is over it. Each robot controls its speed along its prescribed path so as to keep 
the surface as low as possible everywhere. 



a constant rate, as if the point were collecting dust. When a 
robot's footprint is over the point, the accumulation function 
decreases at a constant rate, as if the dust were being vacuumed 
by the robot. The rates of growth and decrease can be different 
at different points in the environment. 

This model is applicable to situations in which the task 
of the robot is to remove material that is collecting in 
the environment, for example cleaning up oil in the ocean 
around a leaking well (T], vacuuming dirt from the floor 
of a building [2|, or tending to produce in an agricultural 
field [3). It is also applicable to monitoring scenarios in which 
the state of the environment changes at each point, and we 
want to maintain up-to-date knowledge of this changing state. 
Some examples include monitoring temperature, salinity, or 
chlorophyll in the ocean J4), maintaining estimates of coral 
reef health Q, or monitoring traffic congestion over a city |6|. 
These applications all share the property that they can never 
be completed, because the environment is always changing. If 
the robot were to stop moving, the oil would collect beyond 
acceptable levels, or the knowledge of the ocean temperature 
would become unacceptably outdated. For this reason we call 
these applications persistent tasks. In this paper, we assume 
that the model is known either from the physics of the 
environment, from a human expert, or from an initial survey 
of the environment. However we show analytically and in 
simulation that our controllers are robust to model errors. 
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We focus on the situation in which the robots are given 
fixed, closed paths on which to travel, and we have to carry 
out the persistent task only by regulating their speed. This is 
relevant in many real- world scenarios. For example, in the case 
that the robots are autonomous cars, they may be constrained 
to move on a particular circuit of roads in a city to avoid 
traffic congestion, or in the case of autonomous aircraft, they 
may be constrained to fly along a particular path to stay away 
from commercial air traffic, or to avoid being detected by 
an adversary. Even in the case of vacuuming the floor of a 
building, the robots may be required to stay along a prescribed 
path so as not to interfere with foot traffic. Furthermore, if we 
have the freedom to plan the robot's path, we can employ an 
off-line planner to generate paths that are optimal according 
to some metric (using the planner in [7] for example), then 
apply the procedure in this paper to control the speed along 
the path. Decoupling the path planning from the speed control 
in this way is a well-established technique for dealing with 
complex trajectory planning problems JS). 

A. Contributions 

Our approach to the problem is to represent the space of all 
possible speed controllers with a finite set of basis functions, 
where each possible speed controller is a linear combination of 
those basis functions. A rich class of controllers can be repre- 
sented in this way. Using this representation as our foundation, 
the main contributions of this paper are the following. 

(i) We formally introduce the idea of persistent tasks for 
robots in changing environments, and propose a tractable 
model for designing robot controllers. 

(ii) Considering the space of speed controllers parametrized 
by a finite set of basis functions, we formulate a linear 
program (LP) whose solution is a speed controller which 
guarantees that the accumulation function eventually 
falls below, and remains below, a known bound every- 
where. If the LP is infeasible, then no controller in the 
space that will keep the accumulation function bounded. 

(iii) We formulate an LP whose solution is the optimal speed 
controller — that which eventually minimizes the maxi- 
mum of the accumulation function over all locations. 

(iv) We generalize to the multi-robot case. We find an LP 
whose solution is a controller which guarantees that 
the accumulation function falls below, and stays below, 
a known bound. The multi-robot controller does not 
require communication between robots. 

We do not find the optimal controller for the multi-robot case, 
however, as it appears that this controller cannot be found as 
the solution to an LP. We demonstrate the performance of the 
controllers in numerical simulations, and show that they are 
robust to stochastic and deterministic errors in the environment 
model, and to unmodeled robot vehicle dynamics. 

It is desirable to cast our speed control problem as an 
LP since LPs can be solved efficiently with off-the-shelf 
solvers Q. This is enabled by our basis function represen- 
tation. The use of basis functions is a common, and powerful 
method for function approximation ifTUl . and is frequently 
used in areas such as compressive sampling [11], adaptive 



control 0121 . and machine learning lfl"3ll . Our LP formulations 
also incorporate both maximum and minimum limits on the 
robot's speed, which can be different at different points on the 
path. This is important because we may want a robot not to 
exceed a certain speed around a sharp turn, for example, while 
it can go much faster on a long straightaway. 

B. Related Work 

Our work is related to the large body of existing research 
on environmental monitoring, sensor sweep coverage, lawn 
mowing and milling, and patrolling. In the environmental 
monitoring literature (also called objective analysis in mete- 
orological research lfl4ll and Rriging in geological research 
ifTSIl ). authors often use a probabilistic model of the environ- 
ment, and estimate the state of that model using a Kalman- 
like filter. Then robots are controlled so as to maximize a 
metric on the quality of the state estimate. For example, the 
work in |[T6l controls robots to move in the direction of the 
gradient of mutual information. More recently, the authors of 
ifTTll and lfl8ll control vehicles to decrease the variance of their 
estimate of the state of the environment. This is accomplished 
in a distributed way by using average consensus estimators 
to propagate information among the robots. Similarly, |[T9ll 
proposes a gradient based controller to decrease the variance 
of the estimate error in a time changing environment. In GUI 
sensing robots are coordinated to move in a formation along 
level curves of the environmental field. In ETI the authors find 
optimal trajectories over a finite horizon if the environmental 
model satisfies a certain spatial separation property. Also, in 
ll22ll the authors solve a dynamic program (DP) over a finite 
horizon to find the trajectory of a robot to minimize the 
variance of the estimate of the environment, and a similar 
DP approach was employed in ll23l over short time horizons. 
Many other works exist in this vein. 

Although these works are well-motivated by the uncon- 
tested successes of Kalman filtering and Rriging in real- 
world estimation applications, they suffer from the fact that 
planning optimal trajectories under these models requires the 
solution of an intractable dynamic program, even for a static 
environment. One must resort to myopic methods, such as 
gradient descent (as in HH, flj], (TBI, ED, ED), or solve 
the DP approximately over a finite time horizon (as in [21 1, 
l22l . [23 1). Although these methods have great appeal from 
an estimation point of view, little can be proved about the 
comparative performance of the control strategies employed in 
these works. The approach we take in this paper circumvents 
the question of estimation by formulating a new model of 
growing uncertainty in the environment. Under this model, 
we can solve the speed planning problem over infinite time, 
while maintaining guaranteed levels of uncertainty in a time- 
changing environment. Thus we have used a less sophisticated 
environment model in order to obtain stronger results on the 
control strategy. Because our model is based on the analogy 
of dust collecting in an environment, we also solve infinite 
horizon sweeping problems with the same method. 

Our problem in this paper is also related to sweep coverage, 
or lawn mowing and milling problems, in which robots with 
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finite sensor footprints move over an environment so that every 
point in the environment is visited at least once by a robot. 
Lawn mowing and milling has been treated in [24] and other 
works. Sweep coverage has recently been studied in |25|, 
and in ll26l efficient sweep coverage algorithms are proposed 
for ant robots. A survey of sweep coverage is given in [27 1. 
Our problem is significantly different from these because our 
environment is dynamic, thereby requiring continual re-milling 
or re-sweeping. A different notion of persistent surveillance 
has been considered in [28 1 and |29|, where a persistent task 
is defined as one whose completion takes much longer than the 
life of a robot. While the terminology is similar, our problem is 
more concerned with the task (sweeping or monitoring) itself 
than with power requirements of individual robots. 

A problem more closely related to ours is that of pa- 
trolling 113011 . OTl . where an environment must be continually 
surveyed by a group of robots such that each point is visited 
with equal frequency. Similarly, in P2ll vehicles must repeat- 
edly visit the cells of a gridded environment. Also, continual 
perimeter patrolling is addressed in 11331 . In another related 
work, a region is persistently covered in [34] by controlling 
robots to move at constant speed along predefined paths. Our 
work is different from these, however, in that we treat the sit- 
uation in which different parts of the environment may require 
different levels of attention. This is a significant difference as it 
induces a difficult resource trade-off problem as one typically 
finds in queuing theory |35|, or dynamic vehicle routing [36|, 
[37 1 . In ll38l , the authors consider unequal frequency of visits 
in a gridded environment, but they control the robots using a 
greedy method that does not have performance guarantees. 

Indeed, our problem can be seen as a dynamic vehicle 
routing problem with some unique features. Most importantly, 
we control the speed of the robots along a pre-planned path, 
whereas the typical dynamic vehicle routing problem considers 
planning a path for vehicles that move at constant speed. Also, 
in our case all the points under a robot's footprint are serviced 
simultaneously, whereas typically robots service one point at a 
time in dynamic vehicle routing. Finally, in our case servicing 
a point takes an amount of time proportional to the size of the 
accumulation function at that point, whereas the service time 
of points in dynamic vehicle routing is typically independent 
of that points wait-time. 

The paper is organized as follows. In Section [TT] we set up 
the problem and define some basic notions. In Section III two 



curve 7 : [0,1] -> M 2 , where 7(0) = 7(1). (See Figure [2] 
for an illustration.) The curve is parametrized by 9 6 [0, 1], 
and we assume without loss of generality that 9 is the arc- 
length parametrization. The environment also contains a single 



LPs are formulated, one of which gives a stabilizing controller, 
and the other one an optimal controller. Multiple robots are 



addressed in Section IV The performance and robustness of 
the controllers are illustrated in simulations in Section |V] 



Finally, Section VI gives conclusions and extensions. 



II. Problem Formulation and Stability Notions 

In this section we formalize persistent tasks, introduce the 
notion of a field stabilizing controller, and provide a necessary 
and sufficient conditions for field stability. 

A. Persistent Tasks 

Consider a compact environment £ C M. 2 , and a finite set of 
points of interest Q C £. The environment contains a closed 



robot (we will generalize to multiple robots in Section IV 1 
whose motion is constrained along the path 7. The robot's 
position at a time t can be described by 9(t), its position 
along the curve 7. The robot is equipped with a sensor with 
finite footprint B(9) C £ (For example, the footprint could 
be a disk of radius r centered at the robot's position)^] Our 
objective is to control the speed v of the robot along the curve. 
We assume that for each point 9 on the curve, the maximum 
possible robot speed is v max (9) and the minimum robot speed 
is v m i n (9) > 0. This allows us to express constraints on the 
robot speed at different points on the curve. For example, for 
safety considerations, the robot may be required to move more 
slowly in certain areas of the environment, or on high curved 
sections of the path. To summarize, the robot is described by 
the triple TZ := (B, « min , u max ). 

A time-varying field Z : Q x M> — > M> , which we call 
the accumulation function, is defined on the points of interest 
Q. This field may describe a physical quantity, such as the 
amount of oil on the surface of a body of water. Alternatively, 
the field may describe the robot's uncertainty about the state 
of each point of interest. We assume that at each point q G Q, 
the field Z increases (or is produced) at a constant rate p(q). 
When the robot footprint is covering q, it consumes Z at a 
constant rate c(q), so that when a point q is covered, the net 
rate of decrease is p(q) ~ c(q). Thus, Z evolves according 
to the following differential equation (with initial conditions 
Z(q,0) and 0(0)): 

fp(q), if q 

Z(V, t) = I p(q) - c(q), if q £ B(9(t)) and Z(q, t) > 0, 
[0, if qeB(9(t)) and Z(q,t) = 0, 

(1) 

where for each q € Q, we have c(q) > p(q) > 0. In this 
paper, we assume that we know the model parameters p(q) 
and c(q). It is reasonable to assume knowledge of c(q) since 
it pertains to the performance of the robot. For example, in 
an oil cleanup application, the consumption rate of oil of the 
robot can be measured in a laboratory environment or in field 
trials prior to control design. As for the production rate, p(q), 
this must be estimated from the physics of the environment, 
from a human expert (e.g. an oil mining engineer in the case 
of an oil well leak), or it can be measured in a preliminary 
survey of the environment. However, the accuracy of the model 
is not crucial, as we show analytically and in simulations 
that our method has strong robustness with respect to errors 
in p(q). The production function can also be estimated and 
incorporated into the controller on-line, though we save the 
development of an on-line strategy for future work. 

With this notation, we can now formally define a persistent 
task. 

'For a ground robot or surface vessel, the footprint may be the robot's field 
of view, or its cleaning surface. For a UAV flying at constant altitude over 
a 2D environment, the footprint could give the portion of the environment 
viewable by the robot. 
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Fig. 2: An illustration of a curve 7 followed by the robot. The robot is located 
at 8 and has footprint 13(8). The set -F(q) of robot positions 8 for which the 
footprint covers q are shown as thick grey segments of the curve. 



Definition II.l (Persistent Tasks). A persistent task is a tuple 
(1Z, 7, Q,p, c), where 1Z is the robot model, 7 is the curve 
followed by the robot, Q is the set of points of interest, and p 
and c are the production and consumption rates of the field, 
respectively. 

In general, for a given persistent task, the commanded 
speed v could depend on the current position 9, the field 
Z, the initial conditions 6(0) and Z(q, 0), and time t. Thus, 
defining the set of initial conditions as IC := (0(0), Z(q, 0)), a 
general controller has the form v(9, Z,lC,t). For the reader's 
convenience, the notation used in this paper is summarized in 
Table H] 



B. Field Stability and Feasibility 

In this section we formalize the problem of stabilizing the 
field in a persistent task. As a first consideration, a suitable 
controller should keep the field bounded everywhere, inde- 
pendent of the initial conditions. This motivates the following 
definition of stability. 

Definition II.2 (Field Stabilizing Controller). A speed con- 
troller field stabilizes a persistent task if the field is always 
eventually bounded, independent of initial conditions. That is, 
if there exists a Z max < +00 such that for every q £ Q and 
initial condition Z((\, 0) and 9(0), we have 

limsupZ(q,<) < Z max . 

t— >+oo 

Note that in this definition of stability, for every initial 
condition, the field eventually enters the interval [0, Z max \. 

There are some persistent tasks for which no controller is 
field stabilizing. This motivates the notion of feasibility. 

Definition II.3 (Feasible Persistent Task). A persistent task is 
feasible if there exists a field stabilizing speed controller. 

As stated above, for a given persistent task, a general speed 
controller can be written as v(0, Z, IC, t). However, in the 
remainder of the paper we will focus on a small subset of 
speed controllers which we call periodic position-feedback 
controllers. In these controllers, the speed only depends on the 
robot's current position 9 e [0, 1]. The controllers are periodic 



in the sense that the speed at a point 9 is the same on each 
cycle of the path. The controller can be written as 



[0,1] 



where each 9 £ [0, 1] is mapped to a speed v(9) satisfying the 
bounds v min (9) < v(9) < V max (0). These controllers have the 
advantage that they do not require information on the current 
state of the field Z, only its model parameters p(q) and c(q). 
While it may seem restrictive to limit our controllers to this 
special form, the following result shows that it is not. 

Proposition II.4 (Periodic Position-Feedback Controllers). If 

a persistent task can be stabilized by a general controller 
v(6, Z, IC, t), then it can be stabilized by a periodic position- 
feedback controller v(9). 



The proof of Proposition II. 4 is given in Appendix [A] and 
relies on the statement and proof of the upcoming result in 
Lemma |II.5| Therefore, we encourage the reader to postpone 



reading the proof until after Lemma II.5 



We will now investigate conditions for a controller to be 
field stabilizing and for a persistent task to be feasible. Let us 
define a function which maps each point q £ Q, to the curve 
positions 9 for which q is covered by the robot footprint. To 
this end, we define 

F(q) :={0e[O,l] |qe£(0)}. 

An illustration of the curve, the robot footprint, and the set 
F(q) is shown in Figure [2] (In Section [V] we discuss how this 
set can be computed in practice.) 

Given a controller 9 i-> v(9), we define two quantities: 1) 
the cycle time, or period, T, and 2) the coverage time per 
cycle r(q). Since v(9) > for all 9, the robot completes one 
full cycle of the closed curve in time 



T 



v(9) 



d6. 



(2) 



During each cycle, the robot's footprint is covering the point 
q only when 9(t) £ F(q). Thus the point q is covered for 



r(q) 



1 



-de. 



(3) 



'F(q) V(9) 

time units during each complete cycle. 

With these definitions we can give a necessary and sufficient 
condition for a controller to stabilize a persistent task. In 



Section III we develop a method for testing if this condition 
can be satisfied by a speed controller. 

Lemma II.5 (Stability condition). Given a persistent task, a 
controller v(9) is field stabilizing if and only if 



c(q) 



1 



F(q) 



v(9) 



d9 > p(q) 



1 



v(9) 



de 



(4) 



for every q £ Q. Applying the definitions in |2| and (IS}, the 
condition can be expressed as i~(q) > (p(q)/c(q))T. 

The lemma has a simple intuition. For stability, the field 
consumption per cycle must exceed the field production per 
cycle, for each point q £ Q. We now prove the result. 
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Proof: Consider a point q e Q, and the set of curve 
positions F(q) for which q is covered by the robot footprint. 
Given the speed controller v, we can compute the cycle time 
T in Q. Then, let us consider the change in the field from 
Z(q, t) to Z(q, t + T), where t > 0. 

Define an indicator function I : [0,1] x Q -> {0, 1} as 
1(9, q) = 1 for 9 G F(q) and otherwise. Then, from ([TJ we 
have that 

Z(q,t)>p(q)-c(q)I(9(t),q), 

for all values of Z(q, i), with equality if Z(q,t) > 0. 
Integrating the above expression over [t, i + T] we see that 

Z(q, t + T) - Z(q, i) > p(q)T - c(q) / + l(0(r), q)rfr 

= p(q)T-c(q)r(q), (5) 

where r(q) is defined in |3j, and the equality follows from the 
fact that I(6(t), q) is simply an indicator function on whether 
or not the footprint is covering q at a given time. From |5]) 
we see that for the field to be eventually bounded by some 
Z max for all initial conditions Z(q, 0), we require that r(q) > 
p(q)/c(q)T for all q £ Q. 

To see that the condition is also sufficient, suppose that 
T (q) > Pio) / c(q)T . Then there exists e > such that 
p(q)T - c(q)r(q) = -e. If Z(q,t) > (c(q) - p(q))T, then 
the field at the point q g Q is strictly positive over the entire 
interval [t,t + T], implying that 

Z(q,t + T) = Z(q,t)-e. 

Thus, from every initial condition, Z(q, t) moves below 
(c(q) —p(qj)T. Additionally, note that for each t in the inter- 
val [t, t+T], we trivially have that Z(q, i) < Z{q, i) +p(q)T. 
Thus, we have that there exists a finite time i such that for all 

t > i, 

Z(q,t) < (c(q)-p(q))T + p(q)T = c(q)T. 

Since Q is finite, there exists a single e > such that for 
every point qe Q we have r(q) — p(q)/c(q)T > e. Hence, 
letting Z max = maXqgQ c(q)T, we see that Z is stable for all 
q, completing the proof. ■ 
In the following sections we will address two problems, 
determining a field stabilizing controller, and determining a 
minimizing controller, defined as follows: 

Problem II.6 (Persistent Task Metrics). Given a persis- 
tent task, determine a periodic position-feedback controller 
v : [0,1] — > M>o that satisfies the speed constraints (i.e., 
v(9) € [v min (9),v max (9)] for all 9 e [0,1];, and 

(i) is field stabilizing; or 

(ii) minimizes the maximum steady-state field T-L(v): 



T-L(v) :— max limsup Z(q, t) 
qe<2 \ t^+oo 

In Section [ill] we will show that by writing the speed 
controller in terms of a set of basis functions, problems (i) 
and (ii) can be solved using linear programs. In Section IV 
we will solve problem (i) for multiple robots. 



III. Single Robot Speed Controllers: 
Stability and Optimality 

In this paper we focus on a finite set of points of interest 
Q = {qi,...,q m }. These m locations could be specific 
regions of interest, or they could be a discrete approximation 
of the continuous space obtained by, for example, laying a 
grid down on the environment. In the Section [V] we will show 
examples of both scenarios. Our two main results are given in 



Theorems III. 1 and III. 8 which show that a field stabilizing 



controller, and a controller minimizing "H(w), can each be 
found by solving an appropriate linear program. 

To begin, it will be more convenient to consider the recipro- 
cal speed controller w _1 (6') := l/v(9), with its corresponding 
constraints 



1 



i 



Now, our approach is to consider a finite set of basis functions 
{/3i(#), . . . ,f3 n (9)}. Example basis functions include (a finite 
subset of) the Fourier basis or Gaussian basis |10|. In what 
follows we will use rectangular functions as the basis: 



1, if 9e [U-l)/n,j/n) 
0, otherwise, 



(6) 



for each j € {1, . . . , n}. This basis, which provides a piece- 
wise constant approximation to a curve, has the advantage that 
we will easily be able to incorporate the speed constraints 
^min(^) and v ma , x (9) into the controller. 

Then let us consider reciprocal speed controllers of the form 



(7) 



where eci, . . . , a n £ K are free parameters that we will use 
to optimize the speed controller. A rich class of functions 
can be represented as a finite linear combination of basis 
functions, though not all functions can be represented this 
way. Limiting our speed controller to a linear parametrization 
allows us to find an optimal controller within that class, while 
preserving enough generality to give complex solutions that 
would be difficult to find in an ad hoc manner. In the following 
subsection we will consider the problem of synthesizing a field 
stabilizing controller. 

A. Synthesis of a Field Stabilizing Controller 

In this section we will show that a field stabilizing speed 
controller of the form (|7]i can be found through the solution of 
a linear program. This result is summarized in Theorem |III. 1 1 
We remind the reader that A summary of the mathematical 
symbols and their definitions is shown in Table [I] 

To begin, let us consider reciprocal speed controllers in 
the form of ((TJ. Then for q^ 6 Q, the stability condition in 
Lemma 111. 5 1 becomes 



zZ a 3 

j = l 



Pj(6)d0 > 



p(<h) 
c(qi) 



/3j{9)d9 
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TABLE I: Table of Symbols 



Symbol Definition 



£ 



7 
v(9) 
TZ 
B{ff) 
F(cO 
Z(q,t) 

p(q) 

c(q) 

a 
T 
r(q) 



the environment. 

a set of points of interest in the environment £ . 
a point of interest in Q. 

a curve followed by the robot, and parametrized by 6. 
the speed of the robot along the curve 7. 

the robot model, consisting of minimum and maximum speeds t> m m(#) and i> ma x(#), and a robot footprint B(8). 
The set of points in the environment covered by the robot's footprint when positioned at 6. 
the values of 6 £ [0, 1] along the curve 7 for which the point q is covered by the footprint, 
the field (or accumulation function) at point q £ Q at time t. 
the rate of production of the field Z at point q. 

the rate of consumption of the field Z when q is covered by the robot footprint, 
a basis function. 

a basis function coefficient, or parameter. 

the time to complete one full cycle of the curve. 

the amount of time the point q £ Q is covered per cycle. 

cost function j!0|> giving the steady-state maximum value of Z(q, t) 



Rearranging, we get 

n 

^ctjKfaPj) > 0, 
where we have defined 



F(q;) CyQl) JO 



P j {e)d6. (8) 



Finally, to satisfy the speed constraints we have that 

1 " 1 

<X>ift(<0<- 



(0) 



(9) 



For the rectangular basis in ([6|, the speed constraints become 
1 1 



<W Wmin(j) 



where 



^max(j) = inf V n 

eel(j-l)/n,j/n) 

Wmin(j) = SUp V n 

ee[(j-l)/n,j/n) 

Thus, we obtain the following result. 



K (6), and 



Theorem III.l (Existence of a Field Stabilizing Controller). 

A persistent task is stabilizable by a speed controller of the 
form |7]l if and only if the following linear program is feasible: 

minimize 

n 

subject to ajK(<\i, j3j) > V i € {1, . . . , m} 

v m- aj -^' Vj€{l,...,n}, 

where i^(q.j,/3j) is defined in ([8]), and ai, . . . ,a n are the 
optimization variables. 

Hence, we can solve for a field stabilizing controller using 
a simple linear program. The program has n variables (one for 
each basis function coefficient), and 2n + m constraints (two 
for each basis function coefficient, and one for each point 
of interest in Q). One can easily solve linear programs with 



thousands of variables and constraints [9|. Thus, the problem 
of computing a field stabilizing controller can be solved 
for finely discretized environments with thousands of basis 
functions. Note that in the above lemma, we are only checking 
feasibility, and thus the cost function in the optimization is 
arbitrary. For simplicity we write the cost as 0. 



In Theorem III.l the cost is set to to highlight the 
feasibility constraints. However, in practice, an important 
consideration is robustness of the controller to uncertainty 
and error in the model of the field evolution, and in the 
motion of the robot. Robustness of this type can be achieved 
by slightly altering the above optimization to maximize the 
stability margin. This is outlined in the following result. 

Corollary III.2 (Robustness via Maximum Stability Margin). 

The optimization 

maximize B 

n 

Y^ctiKfafa) >B Vt€{l,...,m} 
v M ~ aj - VT7j)> Vj€{l,...,n}, 

^max \J ) u min \J ) 

where a.\, . . . , a n and B are the optimization variables, yields 
a speed controller which maximizes the stability margin, 
min qiS Q yij—i a jK(c^i, j5j). The controller 

(i) has the largest decrease in Z{(\i,t) per cycle, and thus 
achieves steady-state in the minimum number of cycles. 

(ii) is robust to errors in estimating the field production rate. 
If the robot's estimate of the production rate at a field 
point (\i £ Q is p{<\i), and the true value is p(qi) < 
J?(qi) + e, then the field is stable provided that for each 
q* G Q 



e < B ■ c(qi) 




f3 3 (0)d9 



Proof: The first property follows directly from the fact 
that Y?j=i a j^( c lij Pj) i s me amount of decrease of the field 
at point q; in one cycle. 
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To see the second property, suppose that the optimization is 
based on a production rate p(qi) for each £ Q, but the true 
production rate is given by p(qi) := p(q») + e, where e > 0. 
In solving the optimization above, we obtain a controller with 
a stability margin of B > 0. For the true system to be stable 
we require that for each q; G Q, 



3=1 



a ; 



Pj(6)d0 > 



p(q» 



This can be rewritten as 



c(q* 



l3 J (0)d0. 



3=1 



from which we see that the true field is stable provided that 
e < Bc(cu)/( ELi "3 Jo ft W^) for each * e Q. ■ 



Remark III.3 (Alternative Basis Functions). Given an ar- 
bitrary set of basis functions, the inequalities in (|9]i yield 
an infinite (and uncountable) number of constraints, one for 
each £ [0, 1]. This means that for some basis functions the 



linear program in Theorem lll.l can not be solved exactly. 
However, in practice, an effective method is to simply en- 
force the constraints in (|9]l for a finite number of values, 
0i, ... , W . Each 0i generates two constraints in the optimiza- 
tion. Then, we can tighten each constraint by £ > 0, yielding 



£ for each 



i £ {1, . . . , w}. Choosing the number of values w as large 
as possible given the available computational resources, we 
can then increase £ until the controller satisfies the original 
speed constraints. □ 

B. Synthesis of an Optimal Controller 

In this section we look at Problem |II.6| (ii), which is to 
minimize the maximum value attained by the field over the 
finite region of interest Q. That is, for a given persistent task, 
our goal is to minimize the following cost function, 



1-L(v) = max limsup Z(q, t) 
qe<2 V t->+oo 



(10) 



over all possible speed controllers v. At times we will refer to 
the maximum steady-state value for a point q using a speed 
controller v as 

H(q, v) := limsup Z(q, t) 

t— >+oo 



Our main result of this section, Theorem III. 8 is that %{v) 



can be minimized through a linear program. However, we must 
first establish intermediate results. First we show that if v is 
a field stabilizing controller, then for every initial condition 
there exists a finite time t* such that Z(q, t) < H(v) for all 
t > t*. 

Proposition III.4 (Steady-State Field). Consider a feasible 
persistent task and a field stabilizing speed controller. Then, 
there is a steady-state field 

Z:Qx [0,1] ->M>o, 

satisfying the following statements for each q £ Q: 



(i) for every set of initial conditions 0(0) and Z(q, 0), there 
exists a time t* > such that 

Z(q,t) = Z(q,0(t)), 



for all t>t*. 
(ii) there exists at least one £ [0, 1] such that Z(q, I 



0. 



From the above result we see that from every initial condi- 
tion, the field converges in finite time to a steady-state Z(q, 0). 
In steady-state, the field Z(q, t) at time t depends only on 0(t) 
(and is independent of Z(q, 0)). Each time the robot is located 
at 9, the field is given by Z(q, 0). Moreover, the result tells us 
that in steady-state there is always a robot position at which 
the field is reduced to zero. In order to prove Proposition |III.4| 
we begin with the following lemma. Recall that the cycle-time 
for a speed controller v is T := J Q \/v(9)d9. 

Lemma III.5 (Field Reduced to Zero). Consider a feasible 
persistent task and a field stabilizing speed controller. For 
every q £ Q and every set of initial conditions Z(q, 0) and 
9(0), there exists a time t* > T such that 



Z(q,t* +aT) = 0, 



(11) 



for all non-negative integers a. 



Proof: Consider any q £ Q, and initial conditions 
Z(q, 0) and 9(0), and suppose by way of contradiction that 
the speed controller is stable but Z(q, t) > for all t > 
T. From Lemma |II.5| if the persistent task is stable, then 
c(q)r(q) > p(q)T for all q. Thus, there exists e > such 
that c(q)r(q) — p(q)T > e for all q£ Q. From the proof of 
Lemma 111.51 we have that 



Z(q, t + T)- Z(q, t) = p(q)T - c(q)r(q) = -e. 

Therefore, given Z(q, 0), we have that Z(q, t*) — for some 
finite t* > T, a contradiction. 

Next we will verify that if Z(q,t*) = for some t* > T, 
then Z(q, t* + T) = 0. To see this, note that the differential 
equation ([T| is piecewise constant. Given a speed controller 
v(9), the differential equation is time-invariant, and admits 
unique solutions. 

Based on this, consider two initial conditions for ([TJ, 

Z x {q, 0) := Z(q, t*-T)> 0, ^(0) := 0(t* - T) = 0(t*), 



and 



Z 2 (q, 0) := Z(q, t*) = 0, 9 2 (0) := 9(t*). 



Since ([T| is time-invariant, we have that Z\ (q, T) = 
Z(q,t*) = 0, and Z 2 (q,T) = Z(q,t* + T). In addition, 
by uniqueness of solutions, we also know that Z\{q, 0) > 
Z 2 (q,0) implies that Z x (q,T) > Z 2 (q,T). Thus, we have 
that Z(q,t*) = > Z(q,t* + T), proving the desired result. 

■ 

The previous lemma shows that from every initial condition 
there exists a finite time t* , after which the field at a point 
q is reduced to zero in each cycle. With this lemma we can 



prove Proposition III .4 



Proof of Proposition III. 4 In Lemma M.5| we have 



shown that for every set of initial conditions Z(q, 0), 9(0), 
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there exists at time t* > T such that Z(q,t* + aT) = for 
all non-negative integers a. Since T is the cycle-time for the 
robot, we also know that 6(t*+aT) = 0(t*) for all a. Since {j} 
yields unique solutions, (JTTJ uniquely defines Z(q. t) for all 
t>t*, with 

Z(q,t + T) = Z(q,t) for all t > t*. 

Hence, we can define the steady-state profile Z(q,9) as 

Z(q,9(t)) := Z(q,t) for all t E [t*,t* +T). 

Finally, we need to verify that Z(q, 9) is independent of 
initial conditions. To proceed, suppose by way of contradiction 
that there are two sets of initial conditions 6*i(0), Z 1 (q, 0), 
and #2(0), Z 2 (q, 0) which yield different steady-state fields 
Zx(q,8) and Z 2 (q, 9), That is, there exists 9 such that 
Z%(q,d) 7^ Z 2 (q,9). Without loss of generality, assume that 
Z%(q, 0) > Z 2 (q,9). To obtain a contradiction, we begin 
by showing that this implies Zx(q, 9) > Z 2 (q,9) for all 
9. Note that Z x and Z 2 reach their steady-state profiles Z\ 
and Z 2 in finite time. Thus, there exist times t\,t 2 > for 
which 9i(ti)_= 9 2 (t 2 ) = 9, and Zi(q,h) = Zi(q,6), and 
Z 2 (q,t 2 ) = Z 2 (q,6). Since Z\(q, t\) > Z 2 (q,t 2 ), and since 
Z is a continuous function of time, either i) £>i(q, ti +t) > 
Z 2 (q, t 2 + 1) for all t > 0, or ii) there exists a time t > for 
which Z%(q, t\ +t) = Z 2 (q, t 2 + t), which by uniqueness of 
solutions implies Z\(q, t\ + 1) = Z 2 (q,t 2 + t) for all t > t . 
Thus,_Zi(q,ti +_t) > Z 2 (q,t 2 + t) for all t > 0, implying 
that Zi(q, 8) > Z 2 (q, 9) for all 9. 

From Lemma III. 5 there exists a 9 for which Z\{q, 6) = 
0. Since, Zx(q, 9) > Z 2 (q,9) for all 9, we must have that 
Z 2 (q,9) = 0. However, the value of Z\ and Z 2 at 9 uniquely 
defines Z\ and Z 2 for all 9, implying that Zi(q, 9) = Z 2 (q, 9), 
a contradiction. ■ 

From Proposition III.4| we have shown the existence of 
a steady-state field Z(q, 9) that is independent of initial 
conditions Z(q, 0) and 9(0). 

Now, consider a point q e Q and a field stabilizing speed 
controller v(9), and let us solve for its steady-state field 
Z(q,6). To begin, let us write F(q) (the set of 9 values for 
which the point q is covered by the footprint) as a union of 
disjoint intervals 



F(q) = [x u yi] U [22,2/2] U • • • U [x t ,\ 



(12) 



where I is a positive integer, and > Xk > Dk-i for each 
fee {1, . . . , £}j^]Thus, on the intervals [xk,yk] the point q is 
covered by the robot footprint, and on the intervals [yk,Xk+i], 
the point q in uncovered. As an example, in Figure [2j the set 
F(q) consists of two intervals, and thus £ — 2. An example 
of a speed controller and an example of a set F(q) are shown 



in Figures 3a and 3b 



2 Note that the number of intervals I, and the points X\,...,xi and 
yi, . . . ,yi are a function of q. However, for simplicity of notation, we will 



From differential equation ([T]l we can write 

f Xk d9 

Z(q, x k ) = Z(q, y k _-i) + p(q) / — — 
Z{q, y k ) = [Z(q, x k ) + (p(q) - c(q)) 



(13) 



yk d9 V 

k W)J 



where for z e R, we define (z) + := 
equations ( fT3] l and ( fT4] ) we see that 



Z(q, y k ) 



Z(q,Vk-i) +p(q) 



(14) 

max{z, 0}. Combining 

d9 



f Vk d9 



For each & 6 { 1, . . . 
W*-6,*(q) :=p(q) 



, £}, let us defin 
Note that we can write 



6-1 

(q)E 



d6» 



(15) 



(16) 



Z(q,y fc ) = (Z(q,tf fc _i) + JV fc _ lifc (cO)" 
and, from ( fl3| l we have 

Z{q, y k ) > Z(q, y fe _ b ) + N k - b , k {q)- 

Moreover, 



(17) 



Z{q,y k ) = Z(q,y k - b ) + N k - b , k {q), 

if Z{q,y k -j) > for all j £ {1, 



,6-1}. (18) 



Thus, we see that the quantity N k - bik (q) gives the maximum 
reduction in the field between 9 — y k -b and 9 = y k . An 



example for b = 1 is shown in Figure 3c With these definition, 
we can characterize the steady-state field at the points y k . 

Lemma III.6 (Steady-State Field at Points y k ). Given a 
feasible persistent task and a field stabilizing speed controller, 
consider a point q £ Q and the set F(q) = U k=1 \x kl yk\ 
Then, for each k G {1, ...,£} we have 



Z(q, y k ) 



max N k ^ bM (q), 
be{o,...,e-i} 



where N k - b>k (q) is defined in and N kjk (q) := 0. 

Proof: Let us fix fc G {1, ...,£}. Given a field stabilizing 



controller, Proposition III.4| tells us that there exists 9 such 



that Z(q, 9) = 0. It is clear that this must occur for some 
9 e F(q). Therefore, 

^(q, Vj) = for some j E {1, . . . , £}. 

Let b be the smallest non-negative integer such that 
Z(q,yk-b) = 0. By (TT8J we have 



omit writing the explicit dependence. 



Z(q,y k ) =N k - b , k (q) > 0. 

If b = 0, then the previous equation simply states that 
Z(q,y k ) = 0. Now, if N k _ d , k (q) < N k ^ k (q) for all 

3 In this definition, and in what follows, addition and subtraction in the 
indices is performed modulo I. Therefore, if k = 1, then A r j._ 1 k = 1, 
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(a) A sample reciprocal speed controller v(9) (b) The steady-state field Z[q, 9). The set F(q) (c) The calculation of quantities N\^2, ^2,3 and 

consists of three intervals which are shaded on the -/V34. The values represent the maximum reduction 

#-axis. The steady-state profile is increasing outside from yk-i to y k . Thus, ^^2,^34 < while 

of F(q) and decreasing inside F(q). -^2,3 > 0. 

Fig. 3: The steady-state field for a point q £ Q. The set of 8 values for which q is covered is given by F(q) = U [x2,V2\ U [2:3, j/3]. The field is 

produced at a rate p(q) = 3, and is consumed by the footprint at a rate c(q) = 8.5. 



d e {0, .. .,£}, then Z(q,y k ) = raa,x be{0 ^^ ie _ 1} N k - btk (q), 
and we have completed the proof. 

Suppose by way of contradiction that there is d E 
{0,...,£- 1} for which N k - d ,k(<l) > 7V fe _ b , fc (q). From ^7} 
we have 

Z(q,,y k ) > Z{q,y k _ d ) + N k _ dik (q) > N k _ d , k (q) 7 

where the second inequality comes from the fact that 
Z(q.,Vk-d) > 0. However Z(q,y k ) = N k _ bjk (q), implying 
that 7V fe _ difc (q) < N k - b<k (q), a contradiction. ■ 
The above lemma gives the value of the field in steady-state 
at each end point y k . The field decreases from x k to y k (since 
these are the 8 values over which the point q is covered), 
and then increases from y k to x k+ i. Therefore, the maximum 
steady-state value is attained at an end point x k for some k. 



For example, in Figure 3b the maximum is attained at the 
point x\. However, the value at x k can be easily computed 
from the value at y k -i using ( [13) : 

f Xk+1 d9 

Z(q,x k+1 ) = max iV k _ 6fc (q) + p(q) / -^r. 

be{o,...,£-i} J Vk v{9) 

From this we obtain the following result. 

Lemma III.7 (Steady-State Upper Bound). Given a field 
stabilizing speed controller v, the maximum steady-state field 
at q G Q (defined in jlO) ) satisfies 



H(q, v) = max 
ke{i,...,£} 

b£{0,... : l-l} 



^fc,b(q), 



where 



x kib (q) = p(q) 



Xk+l 



dO 

W) 



-c(q) 



b-l 

£ 

w =0 Jx k 



Vk- 



de 



and F(q) = U e k=1 [x k , y k ] with y k > x k > y k - X for each k. 

The above lemma provides a closed form expression (albeit 
quite complex) for the largest steady-state value of the field. 
Thus, consider speed controllers of the form 



-He) 



where f3i,...,/3 n are basis functions (e.g., the rectangular 
basis). For a finite field Q = {qi,...,q TO }, the terms 
^V/c-fc,fc(qi) can be written as 



X k ^ b (qi 



ajX kb (qi,f3j), 



where 

X k}b (<li,Pj) ■■= 



p(q) 01(8^9 - c(q] 

J Dk-b 



6-1 i-Vh-w 

/ Pj{0)M. (19) 



With these definitions we can define a linear program for 
minimizing the maximum of the steady-state field. We will 
write £(q) to denote the number of disjoint intervals on the 
curve 7 over which the point q is covered, as defined in ( fTZ) . 

Theorem III.8 (Minimizing the Steady-State Field). Given a 
feasible persistent task, the solution to the following linear 
program yields a speed controller v of the form Q that 
minimizes the maximum value of the steady-state field H(v). 

minimize B 

n 

subject to ctjXk,b(<li) Pj) < B Vie {1, . . . , m}, 

ke {l,... ,£{*)}, 
6€{0,...,i(q,)-l} 



J2»jK(q l ,P J )>0 

3=1 

1 1 

< a,- < 



Wmax(j) 



«min(j)' 



V i € {1, . . . , m} 



The optimization variables are OLj and B and the quantities 
X k . b (qi, Pj) and K(qi,pj) are defined in ( |19| l and dHJ. 

From the above theorem, we can minimize the maximum 
value of the field using a linear program. This optimization has 
n + 1 variables (n basis function coefficients, and one upper 
bound B). The number of constraints is m X)"=i £(<ii) 2 +m + 
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2n. In practice, t{qi) is small compared to n and m, and is 
independent of n and m. Thus, for most instances, the linear 
program has 0(2n + m) constraints. 

IV. Multi-Robot Speed Controller 

In this section we turn to the multi-robot case. We find 
that a field stabilizing controller can again be formulated as 
the solution of a linear program. Surprisingly, the resulting 
multi-robot controller does not rely on direct communication 
between the robots. We also show that the optimal controller 
(the one that minimizes the steady state field) for multiple 
robots cannot be formulated as an LP as in the single robot 
case. Finding the optimal multi-robot controller is a subject of 
ongoing work. 

The multiple robots travel on fixed paths, but those paths 
my be different (or they may be the same) and they may 
intersect arbitrarily with one another. The robots may have 
different consumption rates, footprints, and speed limits. We 
do not explicitly deal with collisions in this section, though the 
controller we propose can be augmented with existing collision 
avoidance strategies. This is the subject of on going work. 

We must first modify our notation to accommodate multiple 
robots. Consider N robots with closed paths -f r : [0, 1] — > M 2 , 
r e {1,...,N} where 7 r and j r i may intersect with each 
other arbitrarily (e.g., they may be the same path, share 
some segments, or be disjoint). We again assume that the 
parametrization of each curve is an arc-length parametriza- 
tion, normalized to unity. Robot r, which traverses path 
7 r at position 6 r (t), has a consumption rate c r (q) over a 
footprint B r (9 r (t)), and has a speed controller v r (9 r ) with 
maximum and minimum speed constraints v rmin (9 r ) and 
"r,max(^r)i respectively. Let lZ r — (B r ,v r , m i n ,v rmax ) be 
the tuple containing the parameters for robot r and redefine 
1Z := (TZi, . . . , TZn) to be the tuple containing all the robots' 
tuples of parameters. Also, the set of points 9 r from which 
q is in robot r's footprint is denoted F r (q). Furthermore, we 
let 7 :— (71, . . . ,7jv) an d c := (ci, . . . , cjy) be the tuple 
of all robot's paths and all robots' consumption functions, 
respectively. The persistent task with N robots is now written 
(1Z, 7, Q, p, c) as before, and we seek speed controllers v r (9 r ) 



to keep Z(q, t) bounded everywhere, as in Definition II.2 



We make the assumption that when multiple robots' foot- 
prints are over the same point q, their consumption rates are 
additive. Specifically, let A r q (t) be the set of robots whose 
footprints are over the point q at time t, 

■A/q(*) ~{r\qeB r (d r (t))}. 
Then the rate of change of the function Z(q, t) is given by 



Z(q,t) = 



P(q)~ E c -( q )' ifZ(q,t)>0, 

rGJVq(i) 

(p(q)- E c -(q)) + ' if^(q,t) = 0. 

reJVq(i) 



(20) 

We can reformulate a stability condition analogous to Lemma 



II. 5 to suit the multi-robot setting, but we must first establish 
the notion of a common period for the speed controllers of 



all the robots. Let T r = f Q v~ 1 (9)d9 be the period of robot 
r, and let r r (q) = j F ^ v~ 1 (9)d9 be the time in that period 
that q is in robot r's footprint. The existence of a common 
period rests on the following technical assumption. 

Assumption IV.l (Rational Periods). We assume that the 
periods T r are rational numbers, so that there exist integers 
num r and den r such that T r — num r / den r . 



An immediate consequence of Assumption IV. 1 is that there 
exists a common period T such that T/T r € N for all r. That 
is, each controller executes a whole number of cycles over 



the time interval T. Specifically, letting T = H^ =1 num r , we 
have T/T r = den r II^ =1 r , j. r nunv ■ Now, we can state the 
necessary and sufficient conditions for a field stabilizing multi- 
robot controller. 

Lemma IV.2 (Multi-Robot Stability Condition). Given a 
multi-robot persistent task, the set of controllers 9 r 1— > v r {9 r ), 
r G {!,..., N} is field stabilizing if and only if 



N 

E 



Tv(q) 



-(q) > p(q) 



(21) 



for every q € Q, where T r — f Q v r 1 (9)d9 and r r (q) = 

Ik „.<y ; 

The lemma states an intuitive extension of Lemma |II.5| 
which is that the total consumption per cycle must exceed 
the total production per cycle at each point q £ Q. 



Proof: The proof closely follows the proof of Lemma II. 5 



Consider the change of Z(q,t) at a point q over any time 
interval T, where T is a common period of all the rational 



periods T r , so that T/T r e N for all r. By integrating (20 1 
we have that 



Z(q,t + T)-Z(q,t) > Tp(q) 



N 

E 

r=l 



I r (r, q)c r (q)dT 



T/T r t+kTr 

Tp(q)- Y E / I,(r,q)c r (q)dr 

r=JV k=i Jt+(k-l)T r 



T 



(p(q) 



N 



I N T r(q) 

^c T {q)—- 

r=l r 



)• 



where l r (t, q) takes the value 1 when q is in the footprint of 
robot r and otherwise. To simplify notation, define C(q) := 

T,r=i ^(q) and ^(q) : = T,r=i cr(q)r r (q)/r r . 

First we prove necessity. In order to reach a contradiction, 
assume that the condition in Lemma |IV.2| is false, but the 
persistent task is stable. Then T(p(q) — C'(q)) > for some 
q, implying Z(q,t + T) > Z(q,t) for some q and for all t, 
which contradicts stability. In particular, for an initial condition 
Z(q, 0) > Z max , Z(q, Tk) > Z max for all k = 1, . . .. 

Now we prove sufficiency. If the condition is satisfied, 
there exists some e > such that T(p(q) — C*(q)) = — e. 
Suppose that at some time t and some point q, Z(q, t) > 
T(C(q) — p(q)) (if no such time and point exists, the 
persistent task is stable). Then for all times in the interval 
re [t,t + T], Z(q,t + r) > 0, and by (|20jl we have that 
Z(q,t + T) - Z(q,t) = T(p(q) - C(q)) = -e. Therefore, 
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after finitely many periods T, Z(q, t) will become less than 
T(C(q) — p(q)j. Now for a time t and a point q such that 
Z(q,t) < T(C(q) - p(q)), for all times r G [t,i + T] we 
have that Z(q,f + -r) < Z(q,t)+p(q)T < TC(q). Therefore, 
once Z(q,t) falls below T(C(q) - p(q)) (which will occur 
in finite time), it will never again exceed TC(q). Therefore 
the persistent task is stable with Z max = max qe Q TC(q). ■ 

Remark IV.3 (Justification of Rational Periods). Assumption 
\IV.l\ is required only for the sake of simplifying the exposition. 
The rational numbers are a dense subset of the real numbers, 
so for any e > we can find num r and den r such that 
mini,. / dcn r < T r < num r / den r +e. One could carry the e 
through the analysis to prove our results in general. □ 

A. Synthesis of Field Stabilizing Multi-Robot Controllers 

A field stabilizing controller for the multi-robot case can 
again be formulated as the solution of a linear program, 
provided that we parametrize the controller using a finite 
number of parameters. Our parametrization will be somewhat 
different than for the single robot case, however. Because 
there are multiple robots, each with its own period, we must 
normalize the speed controllers by their periods. We then 
represent the periods (actually, the inverse of the periods) as 
separate parameters to be optimized. In this way we maintain 
the independent periods of the robots while still writing the 
optimization over multiple controllers as a single set of linear 
constraints. 

Define a normalized speed controller v r (0 r ) '■= T r v r (9 r ), 
and the associated normalized coverage time f r (qi) := 
f F , , v~ 1 {6)d6 — r r {(\i)/T r . We parametrize u" 1 as 

n r 

<V 1 (6r) = ^2a r j0 rj (6 r ), 

3 = 1 

where n r is the number of basis functions for the rth robot, 
and a r j £ R and /3 r j '■ [0, 1] — > M>o are robot r's jth 
parameter and basis function, respectively. It is useful to 
allow robots to have a different number of basis functions, 
since they may have paths of different lengths with different 
speed limits. Assuming the basis functions are normalized with 
/„ P rj (9)d6 = 1 for all r and j, then we can enforce that v r 
is a normalized speed controller by requiring 



E< 

i=i 



1 Vr G {1,...,N}. 



As before, we could use any number of different basis func- 
tions, but here we specifically consider rectangular functions 
of the form (|6j. We also define the frequency for robot r as 
f r := 1/T r , and allow it to be a free parameter, so that 

fr 



,.) = f r v r (9 r ) = 



Si'=l a rjPrj(9r) 



(22) 



From pT| ), for the set of controllers to be field stabilizing, we 
require 



JV 

E 

r=l 



Cr(qi) ><< 



j=1 Jfa* 



for all r G {1, . . . , N} and q; G Q. Thus, defining 

K r {m,^) :=Cr(q() / Prj(0)d0, (23) 
the stability constraints become 

N n T 
r=l j=l 

To satisfy the speed constraints, we also require that 



«^i»^r) < VrHOr) < ^1^), which from j22| leads 
to 



^— <Y t a rj p rj {<B r ) < f - 



1 /'.max V r / 



(OrY 



for all 9 r G [0, 1]. For the rectangular basis functions in 
this specializes to f r /v r>lain (j) < a rj < f r Jv r>min (j) for all 
r and j, where 



^V,maxw) • VUl ^r,max(^r) 

ee[(]-l)/n r , 3 /n r ) ' 



and 



sup /' 

ee[(j-i)/n r ,j/n r ) 



r.min y^r 



This gives a linear set of constraints for stability, which allows 
us to state the following theorem. 

Theorem IV.4 (Field Stabilizing Multi-Robot Controller). A 

persistent task is stabilizable by a set of multi-robot speed 



controllers v r (6 r ), r G {1, ...,N}, of the form {22 \ if and 
only if the following linear program is feasible: 

minimize 

N n T 

subject to EE «rj-KV(qii Pj) > p(<h), 

r=l j=l 



E ar i = 1 

J = l 
fr >0 
fr 



W <°«< 



fr 



V r .i 



>(i) 



Vie{l,. 
Vr €{1, 

Vr G {1, 
Vje{l, 
rG{l,. 



. ,m} 
.,N} 

.,N} 

.,n r }, 

■,N}, 



where each a r j and f r is an optimization variable, and 
K r (qi,/3j) is defined in 



The above linear program has Y^r=i Ur ^ variables (one 
for each basis function coefficient a r j, and one for each 
frequency / r ), and m + Y^=i ^( n r + 1) constraints. Thus, 
if we use n basis functions for each of the N robot speed 
controllers, then the number of variables is (n+ 1)N and the 
number of constraints is m + 2N(n + l). Therefore, the size of 
the linear program grows linearly with the number of robots. 

Remark IV.5 (Maximizing Stability Margin). As summa- 



rized in Corollary III. 2 rather than using the trivial cost 



function of in the LP in Theorem IV.4 one may wish 
to optimize for a meaningful criterion. For example, the 
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controller that gives the minimum number of common periods 
to steady-state can be obtained by maximizing B subject to 

J2r=i E"=i ow-Kr(qi, Pj) - p(<lt) >£Vie{l,...,m}, 
in addition to the other constraints of Theorem \IV.4\ □ 

Remark IV.6 (Minimizing the Steady State Field). The reader 
will note that we do not find the speed controller that mini- 
mizes the steady state field for the multi-robot case, as was 
done for the single robot case. The reason is that the quantities 
called Nk^b k in the single robot case would depend on the 
relative positions of the robots in the multiple robot case. 
These quantities would have to be enumerated for all possible 
relative positions between the multiple robots and there are 
infinite such relative positions. Thus the problem cannot be 
posed as an LP in the same way as the single robot case. We 
are currently investigating alternative methods for finding the 
optimal multi-robot controller, such as convex optimization. .□ 

V. Simulations 

In this section we present simulation results for the single 
robot and multi-robot controllers. The purpose of this section 
is threefold: (i) to discuss details needed to implement the 
speed control optimizations, (ii) to demonstrate how con- 
trollers can be computed for both discrete and continuous 
fields, and (iii) to explore robustness to modeling errors, 
parameter uncertainty, robot tracking errors, and stochastic 
field evolution. 

The optimization framework was implemented in 
MATLAB®, and the linear programs were solved using 
the freely available SeDuMi (Self-Dual-Minimization) 
toolbox. To give the readers some feel for the efficiency of 
the approach, we report the time to solve each optimization 
on a laptop computer with a 2.66 GHz dual core processor 
and 4 GB of RAM. The simulations are performed by 
discretizing time, and thus converting the field evolution into 
a discrete-time evolution. To perform the optimization, we 
need a routine for computing the set F(q) for each field point 
q € Q. This is done as follows. We initialize a set F(q) for 
each point q, and discretize the robot path into a finite set 
{9i, . . . , 9 n }. For the rectangular basis, this disretization is 
naturally given by the set of basis functions. We iteratively 
place the robot footprint at each point 0j, oriented with the 
desired robot heading at that point on the curve, and then add 
8i to each set F(q) for which q is covered by the footprint. 
By approximating the robot footprint with a polygon, we can 
determine if a point lies in the footprint efficiently (this is a 
standard problem in computational geometry). 

Figure [4] shows a simulation for one ground robot perform- 
ing a persistent monitoring task of 10 points (i.e., \Q\ = 10). 
The environment is a 70 m by 70 m square, and the closed path 
has a length of 300 m. For all points we set the consumption 
rate c(q) = 1 (in units of (field height)/s). For each yellow 
point q we set p(q) = 0.15, and for the single red point we 
set p(q) = 0.35. The robot has a circular footprint with a 
radius of 12 m, and for all 9 the robot has a minimum speed 
of I'min = 0.2 m/s and a maximum speed of u max = 2 m/s. 
If the robot were to simply move at constant speed along the 
path, then 8 of the 10 field points would be unstable. The speed 






Fig. 4: An example with a field consisting of 10 points. The field Z(q,t) 
at each point is indicated by the area of the disk centered at the point. The 
vehicle footprint is a disk. The time sequence of the three snapshots goes from 
left to right. The vehicle is moving counter-clockwise around the top half of 
the figure eight and clockwise around the bottom half. The time evolution of 
the red field point is shown in Figure [5] 



iJ 




500 1000 
time 



(a) The optimal speed controller, (b) The field Z(q, t) at the red (dark) 

point. 

Fig. 5: The optimal speed controller corresponding to the curve and field 
shown in Figure [4] The minimum speed is i) m i n = 0.22 m/s and the 
maximum speed is v m3x = 2 m/s. The field Z(q, t) is shown for the red 
(dark) point in Figure [4] It converges in finite time to a steady state. 



controller resulting from the optimization in Section |III-B| is 



shown in Figure 5a A total of 150 rectangular basis functions 
were used. The optimization was solved in less than 1/10 
of a second. Using the speed controller, the cycle time was 
T = 420 s. 

The field Z(q,t) for the red (shaded) point in Figure [4] is 
shown as a function of time in Figure [5b] One can see that the 
field converges in finite time to a periodic cycle. In addition, 
the field goes to zero during each cycle. The periodic cycle is 



the steady-state as characterized in Section III-B 



In Figure |6| a simulation is shown for the case when the en- 
tire continuous environment must be monitored by two aerial 
robots. For multiple robots we synthesized a field stabilizing 
controller by maximizing the stability margin, as discussed in 



Remark IV. 5 The continuous field is defined over a 690 m by 
690 m square, and was approximated using a 32 x 32 grid. 
For all points q g Q we set the consumption rate c(q) = 1, 
and the production function is shown in Figure [7] Robot 1 
followed a figure-eight path which has a length of 2630 m, 
while robot 2 followed a four-leaf clover path with a length of 
2250 m. The footprint for robot 1, the higher (yellow) robot 
had a radius of 100 m, and the speed constraints were given 
by i'min, 1 = 1-5 m/s and w max .i = 15 m/s. The footprint 
for robot 2, the lower (red) robot, had a radius of 133 m, 
and the speed constraints were given by i> m in,2 = 2 m/s and 
^max,2 = 20 m/s. The cycle time for robot 1 was T\ = 519 s, 
and the cycle time for robot 2 was T2 = 443 s. A total of 150 
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Fig. 6: An example of using a discretized approximation of a continuous field for two robots. The grid is 32 X 32, and the field Z(q, t) at each point is 
shown as the surface. The footprint of the higher robot (yellow) is 4/3 the radius of the footprint of the lower (red) robot. The snapshot sequence goes from 
left to right with the left snapshot showing the initial condition of the robots and field. 




Fig. 7: The production function p(q) for the simulations in Figures [8] and |6] 



rectangular basis functions were used for each robot's speed 
controller. The optimization was solved in approximately 10 
seconds. A snapshot for a simulation with three robots is 
shown in Figure [T] In this case, the green robot flying at a 
higher altitude has a square footprint which is oriented with 
the robot's current heading. 

A. A Case Study in Robustness 

In this subsection we demonstrate how we can compute 
a speed controller that exhibits robustness to motion errors, 
modeling uncertainty, or stochastic fluctuations. This is impor- 
tant since the speed controller is computed offline. It should 
be noted, however, that in practice the controller can be 
recomputed periodically during the robot's execution, taking 
into account new information about the field evolution. 



As shown in Corollary 111.2 we can maximize stability 
margin of a speed controller. The corollary showed that in 
maximizing this metric we obtain some robustness to error. To 
explore the robustness properties of this controller, consider 
the single robot example in Figure [8] In this example, the 
square 665 m by 665 m environment must be monitored by 
an aerial robot. We approximated the continuous field using 
a 32 x 32 grid. The consumption rate was set to c(q) = 5, 
for each field point q e Q. The production rate of the field 
was given by the function shown in Figure [7] The maximum 
production rate of the field was 0.74 and the average was 0.21. 



The robot had a circular footprint with a radius of 133 m, a 
minimum speed of v m \ n — 1.5 m/s and a maximum speed of 
« max = 15 m/s. The path had a cycle length of 4200 m. If 
the robot followed the path at constant speed, then 80 of the 
points would be unstable. 

For the speed controller, we used 280 rectangular basis 
functions, and solved the optimization as described in Corol- 
lary III. 2 resulting in a stability margin of B — 97.8. The 



optimization was solved in approximately 10 seconds. The 
time for the robot to complete one cycle of the path using this 
controller was T = 439 s. 

Stochastic field evolution: Now, suppose that we add zero- 
mean noise to the production function. Thus, the robot based 
its speed controller on the "nominal" production function 
p(q) (shown in Figure [7J, but the actual production function 
is p(q, t) — p(q) + n(t, q), where n(t, q) is noise. For 
the simulation, at each time instant i, and for each point 
q G Q, we independently draw n(t, q) uniformly from the 
set [— n max , n max ], where n max > is the maximum value of 
the noise. 

The simulations were carried out as follows. We varied the 
magnitude of the noise n max , and studied the maximum value 
reached by the field. For each value ?i max , we performed 20 tri- 
als, and in each trial we recorded the maximum value reached 
by the field on a time horizon of 2500 s. In Figure [9] we 
display statistics from the 20 independent trials at each noise 
level, namely the mean, minimum, and maximum, as well as 
the standard deviation. With zero added noise, one can see that 
the mean, minimum, and maximum all coincide. As noise is 
added to the evolution, the difference between the minimum 
and maximum value grows. However, it is interesting to note 
that while the performance degrades (i.e., the mean increases 
with increasing noise), the system remains stable. Thus, the 
simulation demonstrates some of the robustness properties of 
the proposed controller. 

Parameter errors: We can also consider robustness to 
parameter errors. In particular, consider the case where the 
robot bases its optimization on a production rate of p(q) 
(shown in Figure |7J, but the actual production rate is given 
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Fig. 8: An example of using a discretized approximation of a continuous field. The field Z(q, t) at each point is shown as the surface. The footprint of the 
vehicle is a disk, and the robot's trajectory is given by the "six leaf" clover. The time sequence of the three snapshots goes from left to right. In the rightmost 
snapshot the vehicle has just finished reducing the large peak that forms in the left and center snapshots. 




Magnitude of noise added to production rate 



Fig. 9: The robustness of the speed controller to noise in the production rate. 
For each value of noise n ma x, the plot shows statistics on the maximum value 
reached by the field over 20 independent trials. 





Fig. 10: The robustness of the speed controller to tracking errors. The left 
figure shows the absolute tracking error in meters. In the right figure, the 
solid line is the desired path, and the dashed line is the path executed by the 
unicycle robot. 



by p(q) = p(q) + e> where e > (note that the field 
is trivially stable for any e < 0). By maximizing the sta- 
bility margin, we obtain some level of robustness against 
this type of parameter uncertainty. The amount of error that 
we can tolerate is directly related to the stability margin 
B = 97.8, as shown in Corollary |III.2| In particular, we obtain 

e < £c(q«)/(E"=i a 3 Jo Pj( S W) = °- 074 - We performed 
simulations of monitoring task for successively larger values 
of e. From this data, we verified that the field remains stable 
for any e < 0.07. For this example, the average value of p(q) 
over all points q is 0.21, and so we can handle uncertainty in 
the magnitude of the production rate on the order of 30%. 

Tracking error: After running the speed optimization, a 
robot has a desired trajectory, consisting of the pre-specified 
path and the optimized speed along the path. In practice, this 
trajectory will be the input to a tracking controller, which takes 
into account the robot dynamics. Since there are inevitably 
tracking errors, the stability margin of the controller is needed 
to ensure field stability. As an example, we considered a 
unicycle model for an aerial robot. In this model, the robot's 
configuration is given by a heading <fi and a position (x, y). The 
control inputs are the linear and angular speeds: x — v cos (f>, 
y = usin0, and 4> — uj. The linear speed had bounds of 
v m in — 1-5 m/s and w max = 15 m/s, and the angular speed 



was upper bounded by 0.5 rad/s. We used the same speed 
controller as in the previous two examples (maximizing the 
stability margin). For trajectory tracking, we used a dynamic 
feedback linearization controller |39|. We chose conservative 
controller gains of 0.5 for the proportional and derivative 
control in order to accentuate the tracking error. The results 
are shown in Figure[l0] Due to the stability margin of 97.8, the 
field remains stable, even in the presence of this tracking error. 
However, in simulation, the maximum field height increased 
by about 13% from 268 (as shown for the zero noise case in 
Figure [9} to 305. 

VI. Conclusions, Extensions, and Open Problems 

In this paper we proposed a model for persistent sweeping 
and monitoring tasks and derived controllers for robots to 
accomplish those tasks. We specifically considered the case 
in which robots are confined to pre-specified, closed paths, 
along which their speed must be controlled. We formulated 
an LP whose solution gives speed controllers that keep the 
accumulation bounded everywhere in the environment for 
single robots and multiple robots. For single robots, we also 
formulated a different LP to give the optimal controller — the 
one that keeps the accumulation function as low as possible 
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everywhere. We see this as a solution to one kind of persistent 
task for robots, but many open problems remain. 

A. Extensions and Open Problems 

We are interested in the general problem of solving per- 
sistent tasks, which we broadly define as tasks that require 
perpetual attention by a robotic agent. The main objective 
of a persistent task is to maintain the accumulation of some 
undesirable quantity at as low a value as possible over an en- 
vironment using one or multiple robotic agents. The difficulty 
of this problem depends on what is known by the robots, and 
precisely what the robots' capabilities are. Let us enumerate 
several possible dimensions for extension on this problem. 

• Trajectory vs. Path vs. Speed One might consider 
controlling only the speed over a prescribed path, as we 
have done in this paper, only the path with a prescribed 
speed, or complete trajectory planning. 

• Single vs. Multiple Robots There may be only one 
robot, or there may be a team of robots, potentially with 
heterogeneous capabilities and constraints. 

• Known vs. Unknown Production Rate The robot may 
know (or be able to sense) the production rate, or it may 
have to learn it over time from measurements. 

• Constant vs. Time-Varying Production Rate The pro- 
duction rate may be constant in time, or it may change 
indicating a change in the environment's dynamics. 

• Finite vs. Continuum Points of Interest The points of 
interest in the environment may be viewed as a finite set 
of discrete points over which to control the accumulation, 
or as an infinite continuum in which we would like to 
control the accumulation at every point. 

In this paper we specifically considered speed control over a 
prescribed path of both single and multiple robots in a finite 
environment with a known, constant production rate. 

One interesting direction in which to expand this work 
is to consider planning full trajectories for robots to carry 
out persistent tasks. The high dimensionality of the space of 
possible trajectories makes this a difficult problem. However, 
if the robot's path is limited by some inherent constraints, 
then this problem may admit solutions with guarantees. For 
example, underwater gliders are commonly constrained to take 
piecewise linear paths, which can be parametrized with a low 
number of parameters. Another direction of extension is to 
have a robot solve the LP for its controller on-line. This would 
be useful if, for example the production rate is not known 
before hand, but can be sensed over the sensor footprint of the 
robot. Likewise if the production rate changes in time, it would 
be useful for a robot to be able to adjust its controller on-line 
to accommodate these changes. A promising approach for this 
is to repeatedly solve for the LP in a receding horizon fashion, 
using newly acquired information to update the model of the 
field evolution. We continue to study problems of persistent 
tasks for robots in these and other directions. 

Appendix 

Periodic Position-Feedback Controllers 



Consider a general speed controller (Z, IC,t) i->- 
v(6,Z,IC,t) where IC := (Z(q, 0), 6(0)) is the set of initial 
conditions. Since v m i n (9) > for all 9 £ [0, 1], the value 
of 9 strictly monotonically increases from to 1 for every 
valid controller (once it reaches 1 it then wraps around to 
0). In addition, the evolution of Z is deterministic and is 
uniquely determined by the initial conditions and the speed 
controller, as given in ([TJ. Because of this, every controller of 
the form v(9, Z, IC, t) can be written as an infinite sequence of 
controllers i'i(9, IC),V2(9, IC), . . ., where controller v k (6, IC) 
is the controller used during the kth period (or cycle). 

With the above discussion in place, we are now ready to 



prove Proposition II.4 



Proof of Proposition \II.4\ To begin, consider a feasible 
persistent monitoring task and a field stabilizing controller of 
the form v(6,Z,lC,t), where IC := (Z(q, 0), 0(0)). Without 
loss of generality, we can assume that 9(0) :— 0. From 
the discussion above, we can write the general controller 
as a sequence of controllers V\(6, IC), 1)2(0, IC), . . ., where 
controller v k (9,lC), k 6 N is used on the kth period (or 
cycle). 

Since the controller is stable, there exists a Z max 
such that for every set of initial conditions IC, we have 
limsup t _ i . +00 Z(q, t) < Z max . Let us fix e > and fix the 
initial conditions to a set of values IC such that Z(q, 0) > 
Z max + e for all q £ Q. Now, we will prove the result by 
constructing a periodic position-feedback controller. 

Let t\ = and define 

r-l 



tk — tk-1 



1 



-de, 



/o v k (9,IC) 

for each integer k > 2. Thus, controller Vk is used during the 
time interval [tfe, tfc+i]. Following the same argument as in the 
proof of Lemma |II.5| we have that for each q £ Q, 



Z(q, t k ) - Z(q, h) > p(q) £ f — i 



IC) 



-de 



tri Jf(o) v t (9, 



-de. 



In this appendix we will prove Proposition II.4 on periodic 
position-feedback controllers. 



^Jf^v^OJC) 

Now, Z(q, t\) > Z max + e is the initial condition, and 
limsup f _j. +00 Z(q, t) < Z max . Thus, for every fixed 5 £ 
(0, e), there exists a finite k such that Z(q,t k ) < 2 max + 5. 
Since this must hold for every q £ Q, we see that there exists 
an integer k such that 

Kq) E f -ih^ de ~ c(q) £ / -7Fw^ de < °' 

^Jo v e (9,lC) J F (q) v e (9,IG) 

for every q £ Q. Rearranging the previous equation we obtain 

k 1 k 

JF(i)jri v e(9> lc ) Jo £^v e (0,IC) 

(24) 

Therefore, let us define the periodic controller 

v (e) = k\Y /-J » 
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for each 9 e [0, 1]. Note that ifv min (9) < v t (0, IC) < v max (0) 
for all 6 E [0, 1] and all t e N, then 

^min 

(6) < v(0) < (9). 

But, combining the definition of v(9) with ( |24| i, we im- 
mediately see that v(9) satisfies the stability condition in 



Lemma II. 5 and thus v{9) is a field stabilizing position- 



feedback controller. 
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